Discovering Concepts in Structural Data
نویسندگان
چکیده
The explosive growth of databases in scientiic, industrial, and commercial elds has not been accompanied by a similar growth in our ability to analyze and digest this data. The increasing amount and complexity of data creates an urgent need for automatic database analysis tools. This trend is evident in molecular biology data which continues to grow in both size and complexity. This research outlines a general approach to automatically discover repetitive and functional concepts in large structural databases. The Subdue system discovers substructures that compress the database and represent structural concepts in the data. By replacing previously-discovered substructures in the data, multiple passes of Subdue produce a hierarchical description of the structural regularities in the data. To increase the exibility of the system, we describe methods of incorporating domain-dependent information into the discovery process. Because discovery systems such as Subdue are very computationally expensive, we also explore ways of parallelizing the system to improve scalability.
منابع مشابه
Pathology of talent management in urban industries; case study: automotive industries
The purpose of this study is pathology of talent management in the automotive industry, so we identify Challenges and barriers, as well as success factors in this filed. This research is a kind of qualitative study that has been done by coding methodology of qualitative data. We used semi-structured interview to collect data. After collecting data and coding, data are divided into two groups of...
متن کاملDiscovering Structural Patterns in Telecommunications Data
With the increasing amount and complexity of data being collected, there is an urgent need to create automated techniques for mining the data. In particular, data being generated and stored by telecom companies overwhelms scientists' ability to manually discover patterns in the data. Because much of this data is structural in nature, or composed of parts and relations between the parts, linear ...
متن کاملDiscovering the Underlying Components Affecting the Usability of IoT in Iranian Libraries: A Theory Based on Context
Objective: The aim is to discover the underlying context components of IOT usability in Iranian libraries: A qualitative approach consistent with grounded theory. Method: This qualitative study was conducted based on grounded theory. Data were collected through semi-structured interviews with 13 faculty members of knowledge and information science based on purposeful and chain methods. Responsi...
متن کاملSubstructure Discovery Using Minimum Description Length and Background Knowledge
The ability to identify interesting and repetitive substructures is an essential component to discovering knowledge in structural data. We describe a new version of our Subdue substructure discovery system based on the minimum description length principle. The Subdue system discovers substructures that compress the original data and represent structural concepts in the data. By replacing previo...
متن کاملDeveloping a grounded-based model of tranquility in contemporary apartments in Urmia City
Introduction: Stressful life and lack of tranquility in modern society, have been serious problems for human life. Environmental psychology has shown that physical and architectural environments play an important role in this, and since the home is one of the most important environments, they try to offer solutions. This study tries to identify the factors that play an effective role in creatin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999